SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: WEI, Ying-Fei 

(ii) TITLE OF INVENTION: Transforming Growth Factor 

Alpha HI I I 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Carella, Byrne, Bain, Gilfillan, 

Cecchi, Stewart & Olstein 

(B) STREET: 6 Becker Farm Road 

(C) CITY: Roseland 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07068-1739 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: WordPerfect 5 - 1 , Dos Text File 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/778,545 

(B) FILING DATE: January 3, 1997 ^ 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 60/011,136 

(B) FILING DATE: January 4, 1996 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MULL INS , J.G. 

(B) REGISTRATION NUMBER: 33,073 

(C) REFERENCE/DOCKET NUMBER: 325800-541 (PF 220) 

(ix) TELECOMMUNICATION. INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GAAA ATG GCG CCT CAC GGC CCG GGT AGT CTT ACQ ACC CTG GTG CCC TGG 4 9 

Met Ala Pro His Gly Pro" Gly Ser Leu Thr Thr Leu Val Pro Trp 
-25 -20 -15 



GCT GCC GCC CTG CTC CTC GCT CTG GGC GTG GAA AGG GCT CTG GCG CTA 97 
Ala Ala Ala Leu Leu Leu Ala Leu Gly Val Glu Arg Ala Leu Ala Leu 

^10 -5 15. 



CCC GAG ATA TGC ACC CAA TGT CCA GGG AGC GTG CAA AAT TTG TCA AAA 145 
Pro Glu He Cys Thr Gin Cys Pro Gly Ser Val Gin Asn Leu Ser Lys 
10 15 20 

GTG GCC TTT TAT TGT AAA ACG ACA CGA GAG CTA ATG CTG CAT GCC CGT 193 
Val Ala Phe Tyr Cys Lys Thr Thr Arg Glu Leu Met Leu His Ala Arg 
25 30 35 

TGC TGC CTG AAT CAG AAG GGC ACC ATC TTG GGG CTG GAT CTC CAG AAC 241 
CVS Cys Leu Asn Gin Lys Gly Thr He Leu Gly Leu Asp Leu Gin Asn 
40 45 50 



TGT TCT CTG GAG GAC CCT GGT CCA AAC TTT CAT CAG GCA CAT ACC ACT 289 
Cys Ser Leu Glu Asp Pro Gly Pro Asn Phe His Gin Ala Hxs Thr Thr 
55 60 65 70 

GTC ATC ATA GAC CTG CAA GCA AAC CCC CTC AAA GGT GAC TTG GCC AAC 337 
Val He He Asp Leu Gin Ala Asn Pro Leu Lys Gly Asp Leu Ala Asn 
75 80 85 

ACC TTC CGT GGC TTT ACT CAG CTC CAG ACT CTG ATA CTG CCA CAA CAT 3 85 

Thr Phe Arg Gly Phe Thr Gin Leu Gin Thr Leu He Leu Pro Gin His 
90 95 100 

GTC AAC TGT CCT GGA GGA ATT AAT GCC TGG AAT ACT ATC ACC TCT TAT 433 
Val Asn Cys Pro Gly Gly He Asn Ala Trp Asn Thr He Thr Ser Tyr 
105 110 

ATA GAC AAC CAA ATC TGT CAA GGG CAA AAG AAC CTT TGC AAT AAC ACT 481 
He Asp Asn Gin He Cys Gin Gly Gin Lys Asn Leu Cys Asn Asn Thr 
120 125 130 

GGG GAC CCA GAA ATG' TGT CCT GAG AAT GGA TCT TGT GTA CCT GAT GGT 529 
Gly Asp Pro Glu Met Cys Pro Glu Asn Gly Ser Cys Val Pro Asp Gly 
135 140 145 150 

CCA GGT CTT TTG CAG TGT GTT TGT GCT GAT GGT TTC CAT GGA TAC AAG 577 
Pro Gly Leu Leu Gin Cys Val Cys Ala Asp Gly Phe His Gly Tyr Lys 
155 160 165 

TGT ATG CGC CAG GGC TCG TTC TCA CTG CTT ATG TTC TTC GGG ATT CTG 625 
Cvs Met Arg Gin Gly Ser Phe Ser Leu Leu Met Phe Phe Gly He Leu 
^ 170 175 180 

GGA GCC ACC ACT CTA TCC GTC TCC ATT CTG CTT TGG GCG ACC CAG CGC 673 
Gly Ala Thr Thr Leu Ser Val Ser He Leu Leu Trp Ala Thr Gin Arg 
185 190 195 

CGA ATU^ GCC AAG* ACT TCA TGAACTACAT AGGTCTTACC ATTGACCTAA 721 
Arg Lys Ala Lys Thr Ser 
200 

GATCAATCTG AACTATCTTA GCCCAGTCAG GGAGCTCTGC TTCCTAGAAA GGCATCTTTC 781 
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GCCAGTGGAT TCGCCTCAAG GTTGAGGCCG CCATTGGAAG ATGAAAAATT GCACTCCCTT 841 

GGTGTAGACA AATACCAGTT CCCATTGGTG TTGTTGCCTA TAATAAACAC TTTTTTCTTT 901 

TTTAAAAAAA AAAAAAAAAA AA 923 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Pro His Gly Pro Gly Ser Leu Thr thr Leu Val Pro Trp Ala 
-25 -20 -15 -10 

Ala Ala Leu Leu Leu Ala Leu Gly Val Glu Arg Ala Leu Ala Leu Pro 
-5 15 

Glu lie Cys Thr Gin Cys Pro Gly Ser Val Gin Asn Leu Ser Lys Val 
10 15 20 

Ala Phe Tyr Cys Lys Thr Thr Arg Glu Leu Met Leu His Ala Arg Cys 
25 30 35 

Cys Leu Asn Gin Lys Gly Thr lie Leu Gly Leu Asp Leu Gin Asn Cys 
40 45 50 55 

Ser Leu Glu Asp Pro Gly Pro Asn Phe His Gin Ala His Thr Thr Val 
60 65 70 

lie lie Asp Leu Gin Ala Asn Pro Leu Lys Gly Asp Leu Ala Asn Thr 
75 80 85 

Phe Arg Gly Phe Thr Gin Leu Gin Thr Leu lie Leu Pro Gin His Val 
90 • . 95 100 

Asn Cys Pro Gly -Gly lie Asn Ala Trp Asn Thr He Thr Ser Tyr He 
105 110 115 

Asp Asn Gin He Cys Gin Gly Gin Lys Asn Leu Cys Asn Asn Thr Gly 
120 125 130 135 

Asp Pro Glu Met Cys Pro Glu Asn Gly Ser Cys Val Pro Asp Gly Pro 
140 145 150 

Gly Leu Leu Gin Cys Val Cys Ala Asp Gly Phe His Gly Tyr Lys Cys 
155 160 165 

Met Arg Gin Gly Ser Phe Ser Leu Leu Met Phe Phe Gly He Leu Gly 
170 175 180 

Ala Thr Thr Leu Ser Val Ser He Leu Leu Trp Ala Thr Gin Arg Arg 
185 190 195 

Lys Ala Lys Thr Ser 
200. 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGCGGATCCG GGCAAAAGAA CCTTTGC 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GCGTCTAGAC TAAAGCAGTG AGAACGAGCC 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGCGGATCCG TCCATCATGG CGCCTCACGG CCCG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6. 
GCGTCTAGAC TACATAAGCA GTGAGAACGA GCC 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
CGCGGATCCC GGGCAAAAGA ACCTTTGC 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

• (ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
GCGTCTAGAC TACATAAGCA GTGAGAACGA GCC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
. (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
GCGCTCAGAC ATAAGCAGTG AGAACGAGCC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Gly Leu Asn Lys Cys Asn Asn Thr Gly Asp Pro Glu Met Cys Pro 
15 10 15 

Glu Asn Gly Ser Cys Val Pro Asp Gly Pro Gly Leu Leu Gin Cys Val 
20 25 30 

Cys Ala Asp Gly Phe His Gly Tyr Lys Cys Met Arg Gin Gly Ser Phe 
35 40 45 

Ser Leu Leu Met 
50 



